Sanskrit as a Programming Language and Natural Language Processing
نویسنده
چکیده
In this paper represents the work toward developing a dependency parser for Sanskrit language and also represents the efforts in developing a NLU(Natural Language Understanding) and NLP(Natural Language Processing) systems. Here, we use ashtadhayayi (a book of Sanskrit grammar) to implement this idea. We use this concept because the Sanskrit is an unambiguous language. In this paper, we are presenting our work towards building a dependency parser for Sanskrit language that uses deterministic finite automata(DFA) for morphological analysis and 'utsarga apavaada' approach for relation analysis. The importance of astadhayayi is it provide a grammatical framework which is general enough to analyze other language as well therefore it is uses for language analysis. Keyword: Panani Ashtadhayayi, Vibhakti, Karaka, NLP, Sandhi.
منابع مشابه
Fuzzy Modeling and Natural Language Processing for Panini's Sanskrit Grammar
Indian languages have long history in World Natural languages. Panini was the first to define Grammar for Sanskrit language with about 4000 rules in fifth century. These rules contain uncertainty information. It is not possible to Computer processing of Sanskrit language with uncertain information. In this paper, fuzzy logic and fuzzy reasoning are proposed to deal to eliminate uncertain inform...
متن کاملLexicon-directed Segmentation and Tagging of Sanskrit
We propose a methodology for Sanskrit processing by computer. The first layer of this software, which analyses the linear structure of a Sanskrit sentence as a set of possible interpretations under sandhi analysis, is operational. Each interpretation proposes a segmentation of the sentence as a list of tagged segments. The method, which is lexicon directed, is complete if the given (stem forms)...
متن کاملAn Approach for Grammatical Constructs of Sanskrit Language using Morpheme and Parts- of-Speech Tagging by Sanskrit Corpus
Sanskrit since many thousands of years has been the oriental language of India. It is the base for most of the Indian Languages. Statistical processing of Natural Language is based on corpora (singular corpus). Collection of texts of the written and spoken words is known as Language corpus, which is collected in an organized way, in electronic media for the purpose of linguistic research. It pr...
متن کاملBook Reviews: Natural Language Processing: A Paninian Perspective
This book is an elementary introduction to natural language processing, but it is a very unusual one for two reasons. First, its subject languages are those of India; second, although the book sets out and discusses the better-known linguistic grammars that are commonly used in natural language processing, the grammar that the book actually uses is one based on the Astfidhy~y~ (pronotmced 'asht...
متن کاملEtranS- A Complete Framework for English To Sanskrit Machine Translation
Machine Translation has been a topic of research from the past many years. Many methods and techniques have been proposed and developed. However, quality of translation has always been a matter of concern. In this paper, we outline a target language generation mechanism with the help of language English-Sanskrit language pair using rule based machine translation technique [1]. Rule Based Machin...
متن کامل